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Abstract 

It is hard to exaggerate the role of economic aggregators — functions that summarize numerous 
and / or heterogeneous data — in economic models since the early XX*'' century. In many cases, as 
"witnessed by the pioneering works of Cobb and Douglas, these functions were information quantities 
tailored to economic theories, i.e. they were built to fit economic phenomena. In this paper, we 
look at these functions from the complementary side: information. We use a recent toolbox built 
on top of a vast class of distortions coined by Bregman, whose application field rivals metrics' in 
various subfields of mathematics. This toolbox makes it possible to find the quality of an aggregator 
(for consumptions, prices, labor, capital, wages, etc.), from the standpoint of the information it 
carries. We prove a rather striking result. From the informational standpoint, well-known economic 
aggregators do belong to the optimal set. As common economic assumptions enter the analysis, 
this large set shrinks, and it essentially ends up exactly fitting either CES, or Cobb-Douglas, or 
both. To summarize, in the relevant economic contexts, one could not have crafted better some 
aggregator from the information standpoint. We also discuss global economic behaviors of optimal 
information aggregators in general, and present a brief panorama of the links between economic 
and information aggregators. 

Keywords : Economic Aggregators, CES, Cobb-Douglas, Bregman divergences 



1 Introduction 

Since the end of the XIX*'^ century and the birth of the "neo-classical" school, mathematics have 
played a growing role in economics. With the works of Leon Walras, the question of aggregation of 
the behavior of many individuals has risen and become central in the economic theory. In order to 
represent as well as possible the evolution of these aggregate variables, some mathematical functions 
have been proposed and become very famous in the economic literature. 
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One of the most famous neo-classical function is the Cobb-Douglas [3 [19]. This function is of 
particular interest, since it allows for perfect substitutability between the different inputs it depends 
on. Another well-known "linear" function was later formulated by Leontief [13], in which inputs are 
conversely complementary. The choice of such a function to describe the production process has very 
strong implications at the macroeconomic level, as illustrated by many results found by Keynesians 
economics in the literature on growth theory. 

But beyond these different aggregate functions, one of the most recently built and well-known one 
is the constant elasticity of substitution (CES) function elaborated by Arrow et al. [2]. Indeed, in 
the Cobb-Douglas production function, the elasticity of substitution of capital for labor is fixed to 
unity. This implies that a one percent increase in the capital stock implies an equal one percent fall 
in labor inputs in order to maintain a constant production level, given the structure of relative prices. 
On the contrary, the CES function allows this elasticity to lie between zero and infinity, but to stay 
fixed at that number along and across the isoquants, whatever the quantities of inputs that are used 
in the production process. The main advantage exhibited by the CES function is that it encompasses 
the Cobb-Douglas, the Leontief and the Linear production functions, which are in fact limit and thus 
particular cases of it. Nevertheless, one of the reasons economists have kept on using simpler functions 
such as the Cobb-Douglas one is the heavy calculus to which the CES function often leads, especially 
at the point where models have to be closed. 

In a seminal work, Douglas in [lOj highlights the importance of the progresses in the field of 
statistical information in the genesis of his essay. Pioneering works of Cobb and Douglas [7], and 
Arrow et al. [2], underline the inductive nature of the inception of their respective functions, as 
the purpose was to fit as best as possible information quantities (aggregators) to observed economic 
phenomena. In this paper, we take a deductive route paved with a rigorous information material, to 
derive these fundamental quantities based on two assumptions: 

• an aggregator should always be as informative as possible with respect to the data it summarizes 
(prices, consumptions, wages, capital, labor, etc.); 

• an aggregator might be require to satisfy standard economic assumptions, relying on aggregator 
dualities (prices / consumptions, wages / labor, etc.), elasticities, marginal rates of substitutions, 
returns to scale, etc. 

The starting point of our work is a class of distortions coined in the sixties by Bregman [^, in the 
context of convex programming. Though they were born four decades ago, it was only much later 
that these distortions literally spread out to other fields, including statistics, signal processing and 
classification |12j . fields where they had to become undeniably central. It was even later that was 
discovered their broad applicability, with an axiomatization that makes it possible to relate them 
to metrics and their spawns [3]. Very roughly, Bregman divergences are non-negative functions that 
meet the same identity of indiscernibles condition as metrics, and rely on a third assumption about 
the existence of a particular aggregator which minimizes the total distortion to a set. This last 
condition, which can be rephrased as a maximum likelihood condition, makes this aggregator the 
most informative quantity about the data, and we call it a Low Distortion Aggregator (LDA). 

In this paper, our contribution is threefold. First, we make a clear partition of economic aggregators 
with respect to information, as we show that some are LDAs (CES, Cobb-Douglas), some are limit 
cases of LDAs (Leontief), and some are neither (Mitscherlich-Spillman-von Thiinen). Without more 
assumptions, the set of all LDAs is huge, yet we show that global trends of economic relevance can be 
easily shown for all, such as on marginal rates of substitution, and the set can be quite easily drilled 
down for aggregators with general behaviors, such as concavity or convexity. This, in fact, is our last 
contribution. Our main contribution is to show that, when we plug in various standard economic 
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assumptions (see above), the set of all LDAs reduces to a particular subset which precisely matches 
CES, Cobb-Douglas, or both sets. This novel advocacy for the use of these popular aggregators brings 
a very strong information-theoretic rationale to their "economic" existence. 

The remaining of the paper is structured as follows. Section 2 presents LDAs and their main 
properties. In Section 3, we relate common economic aggregators to LDAs. Section 4 discusses 
additional properties of LDAs. A last section concludes the paper, with avenues for future research. 
In order not to laden the paper's body, all proofs have been postponed to an appendix. 

2 Low-distortion aggregators 

For any strictly convex function 93 : X — > R differentiable on int(X), with X C R'^ convex, the Bregman 
Divergence with generator ip \s [H [3] : 

def 

D^{x\\y) = ip{x) - ip{y) - {x -y,V^{y)) , (1) 

where (•,•) denotes the inner product, and V,^ *== [dip/dxi]^ is the gradient operator. In this paper, 
bold notations such as x shall denote vector-based notations, and blackboard faces such as X sets of 
(tuples of) real numbers or natural integers of M or N respectively. D^{x\\y) is the difference between 
the value oi (p at x and the value at x of the hyperplane tangent to p in y. Bregman divergences 
encode a natural notion of distortion, as shown by Theorem [1] below. Its proof is a slight variation of 
Theorem 4 in [3] (see also [1]). 

Theorem 1 Let F : R^xR'^ ^Rbe a function that satisfies the following three axioms 

1. non-negativity: F{x,y) > 0; 

2. identity of indiscernibles: F{x,y) =0 if and only if x = y; 

3. the expectation is the lowest distortion's predictor: for any random variable X whose distribution 
T> has support R'^, 

argygRdmin£;x,F(X,y) = E'dX(^= ^j.x) , (2) 

where Ex> denotes the mathematical expectation. 

Then F{x,y) = D^{x\\y) for some strictly convex and differentiable if : R'^ — > R. 

(proof: see the Appendix) It is easy to check that any Bregman divergence satisfies [1], [2] and [3] [1], 
and so Theorem [T] provides a complete characterization of Bregman divergences, in the same way as 
conditions [1] and [2], completed with symmetry and subadditivity, would axiomatize a metric. This 
positions Bregman divergences with respect to numerous metric-related notions, and gives the impor- 
tance of their main difference, eq. ([2]). Eq. ([2]) is fundamental because it says that the (arithmetic) 
expectation is the lowest distortion parameter for a population, regardless of the distortion. Actually, 
eq. ^ says much more: the expectation is maximum likelihood estimator of data for a large set of 
distributions called the exponential families. These families contain some of the most popular distri- 
butions, such as Bernoulli, multinomial, beta, gamma, normal, Rayleigh, Laplacian, Poisson [H I15j. 
A remarkable property is that any member satisfies the following identity [1]: 

logPr[x\0,Lp] = -D^{x\\fie) + logb^{x) . (3) 
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dom((^) 


(i>(x) 


D,„{x\\ii) 


Divergence name 






y^.-i ixi — Ui)^ 


Squared Euclidean norm 


R^ 


X log X — X 


Ya=1 log Yi-^i + Vi 


Kullback-Leibler div. 


Pd 


id. 




Entropy 


Rf 


— log X 


Vf 1 ^ - log 2^ - 1 


Itakura-Saito div. 


[0,1] 


X log a; 
+ (1 — x) log(l — x) 


xlog 1 
+ (l-x)logi^ 


Logistic loss 



Table 1: Correspondence between various generators and their Bregman divergences. is the d- 

def 

dimensional probability simplex. The generator of the Bregman divergence is defined by ^p{x) = 
J2i=i (t>{xi) (see text). 

6 defines the so-called natural parameters of the distribution, and is a normalization function. It 
follows from ([3]) and ([2]) that the maximum likelihood estimator of data is the expectation parameter 
Me- 

Some Bregman divergences have become cornerstones of various fields of mathematics and com- 
puter science, as shown in Tabled! All of them are separable Bregman divergences [8], as they can be 
characterized using a strictly convex function : I C R ^ R, the generator of the Bregman divergence 
being just: 

^{x) j;<^(x.) . (4) 

i 

It might seem that ([2]) unveils a strong assymetry between the two parameters of a Bregman 
divergence, all the more as that Bregman divergences are not symmetric in almost all cases p^. This 
distinction becomes more superficial — but crucial for our purpose — as Legendre duality enters the 
analysis. Any Bregman divergence is indeed equal to a Bregman divergence over swapped parameters 
in the generator's gradient space. To make it formal, the generator (/9 of a Bregman divergence admits 
a convex conjugate : R'^ — s- R given by |17j : 

ip*{y) =^ sup{{x,y) - ip{x)} (5) 

= {y,V;^\y))-^{V;^\y)) , (6) 

where V~^, the inverse gradient, is well-defined because of the strict convexity of The follow- 
ing Theorem, whose proof follows from plugging ([6]) in ([T]), states the dual symmetry of Bregman 
divergences. 

Theorem 2 D^{x\\y) = D^*{V^{y)\\V^{x)). 

It follows from ([2]) and the strict convexity of (p that the minimizer of the expected dual divergence 
can be expressed in int(dom((/3)) as: 

/.^ II' V^HEi,V^(X)) . (7) 

The set spanned by ([7]), which includes the arithmetic average (take in dH)), is close to the 

set of /-means [TT], a set whose studies date back to the early thirties, by Kolmogorov and Nagumo. 

To summarize the conceptual justifications for the use of aggregators having shape ([7]), three main 
motivations could justify their use: first, they are all optimal distortion estimators — and the only 
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ones to be optimal - in the sense of Theorem [TJ second, they encode maximum hkehhood estimators 
for a majority of popular distributions; third, they encode geodesic-like curves in the geometry of the 
information space |15j . For all these reasons, they can be considered the best information aggregators 
for the data they summarize (data which could be prices, consumptions, labors, wages, etc. in the 
economic world). 

Hereafter, we consider averages ([7]) with finite support of size m > 0, and replace ([2|) by the 
more general search for arg^^gjgd min ^™ 7jF(a;j, y), with 7^ > 0,Vz = l,2,...,m. A rapid glimpse at 
([2]) reveals that the solution is Yl^i'li-'^i^ extension of ([7]) to the minimizer of a general 

weighted sum of Bregman divergences now takes the more general form: 

= rv-i(^i^7.v^(a..)^ , (8) 

with r =^ YliLili- Because of ([2]), any as in ([8]) is called a low-distortion aggregator (LDA). For 
economic and mathematical reasons, averages having the form ([8]) with a concave or convex regime 
are particularly interesting. The following Theorem allows to catch the picture of where concavity 
and convexity lie: the symmetric dual average of some average ([8]) enjoys the symmetric regime. If 
one is concave, the other is convex and vice versa. 

Theorem 3 fj,^ is concave if and only if fj,^* is convex. 

(proof: see the Appendix). To finish up with information, we state the last result that shall be useful 
in the sequel. 

Theorem 4 Let 1^1^ a concave (resp. convex) average that follows Then it is upperbounded (resp. 
lowerbounded) by the sum: s = "^{Lili^i- Furthermore, V,^ is concave (resp. convex), and V""*^ is 
convex (resp. concave). 

(proof: see the Appendix). 



3 Economic Aggregators 

Because a LDA does not change by adding a constant term to its generator (p, it should be kept in 
mind that generators shall be given up to any such constant. Furthermore, our analysis takes place 
for separable generators, that meet This eases readability while encompassing most economic 
settings. For such reasons, it is also convenient to assume that dom0 C M_(., and introduce the 
following notation for any relevant /c € N*: 

^f'^'(x) ^ . (9) 



3.1 Optimality of Economic Aggregators 

Let X* denote an aggregator for values xi, X2, Xm {m € N*). One of the most common economic 
aggregators is the CES function [2]: 

Y.^^^^) ■ (10) 

The formulation in a is not the simplest but it is intentional, as it depicts the constant elasticity of 
substitution inside values aggregated [5]. Here, /3j > is the weight of aggregated value Xj. Further 
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constraints of economic relevance are generally imposed on a depending on the setting in which (jlOp 
is applied [5]; in order to remain as general as possible, we consider the unrestricted setting for which 
(T G M*\{1}. We now show that a CES is a LDA. 

Lemma 1 Any CES as defined in is a LDA for the generator 

, , \ def 2—— /•-! -I N 

<PCEs(2;) = ax - , (11) 
with a G M* any constant for which is convex, and -ji ^= /3jB"^,Vi = l,2,...,m. Here, B =^ 

(proof: see the Appendix). Aggregators are sometimes tied up via important economic equalities. One 
example relates prices and consumptions. Let m denote the number of goods, and the consumption 
function of good i is noted Cj. The price of good i is pi. Two aggregators for consumptions and prices, 
respectively c^ and are devised so as to satisfy: 

m 

ciPi = Pi^Ci, . (12) 

j=i 

Further economic assumptions can be made, such as the concavity of c^,, which indicates the preference 
for diversity [U]. The popular choice for is a CES function pO|) [2\. Notice that the weights in the 
LDA (7,) are different from the weights in the CES (/3j). Modulo a simple normalization of the CES, 
they remain equal. If we multiply by the normalized CES obtained is such that ■ji = Pi- 

Furthermore, this normalization, for which = m^^^^~"^ when all f3i = 1, is one which turns 

out to play a key role in economic models [5]. 

The price aggregator, p^, can be found by inspecting (jl2p after remarking that partial derivatives 
on the left and right-hand side must also coincide. After a standard derivation using (jlOp for c^, we 
obtain that the price index has the form: 



P-^ has also the general CES form of (jlOp : for completeness, we characterize below its LDA (proof 
similar to Lemma [1]). 

Lemma 2 Let 5i =^ (3^, and A =^ Y^^i ^i- The price index in [T3\) is a LDA for the generator: 
with 6 E M* any constant for which is convex, and '= 6iA^^ ,yi = 1,2, ...,m. 



Modulo the normalization of the CES for c*, and the choice /3j = 1, (jl3p would return to the conven- 
tional choice in which ^ 1 /m. It is quite a remarkable fact that and are LDA under the sole 
assumptions of (fT2]) and is a CES. Such a property also holds for labor and wages. Suppose we have 
n consumer-workers, each of which selling a particular labor type; let wj be the wage for labor-type j 
and Uj the demand for labor-type j, for j = l,2,...,n. Then there exists an aggregate labor-demand 
index n*, and a wage index w^,, such that 0: 



n 



'^vjiUi = Wi^Ui, . (15) 
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The CES form for w^, [5] implies both the LDA property for and (Lemmata [T] and [2]) . To 
summarize, popular aggregators for consumptions, prices, labor and wages are all LDAs, which means 
that they are all optimal from the information theory standpoint. Before drilling down further into 
the properties that yield relationships like (jl2p or (jlSp , let us give a brief panorama of which Bregman 
divergences are involved so far. 

The Bregman divergence of a CES (jlOp is: 

D^^^^ix\\z) = a^|x;"^-(^2-i)xiz'"^ + (^l-i)zr^} • (16) 

Since any CES is a LDA, it follows that Cobb-Douglas and Leontief functions are limit LDAs, re- 
spectively when fj — > 1 and a —>■ 0^. While Leontief function, x^, '= minj{/3jXi}, does not admit a 
generator (it is not differentiable) , Cobb-Douglas, 

m 
i=l 

admits one, which is: 

4'cd{x) *== 6(xlogx — x) (18) 
= HKhix) 

(see Table [H b G is any constant). If we look at the price index in ()13p . we get the following 
result. 

Lemma 3 Fix b = l/((2 — cr)(l — a)) in ( f j^D , assuming a 1, and let cftis *== — logx (see TableU^). 
Then 

lim^^p(^ll^) = ^m(aj||2) • (19) 

This result is easily proven once we remark that x^ ~ 1 + fclogx + o{k). The right-hand side of 
()19p is Itakura-Saito divergence (Table [1]). Together with the fact that the limit divergence for is 
Kullback-Leibler divergence when o" — > 1, we get the generators for two popular divergences of signal 
processing and statistics [15]. A well-known similar result holds for a particular subset of Bregman 
divergences, Amari a-divergences, for which [1]: 

0a(x) = 4(^x-x^)/(l-a2),aG [-1,1] . (20) 

Taking limits of the generator when a reaches the interval bounds yields Itakura-Saito and Kullback- 
Leibler divergences. 

3.2 Completeness of Economic Aggregators 

In this section, we consider some relevant economic assumptions about aggregators, and show that any 
LDA that would meet such assumptions would necessarily belong to a particular subclass of LDAs. 
This subclass is called "complete" for the assumption at hand. 

The first assumption we consider is about any two dual aggregators x^ (for xi, X2, Xm) and 
(for zi, Z2, Zm) that would meet the following abstraction of (fT2|) and (fT5]) : 

m 

'^XiZi = x^z^ . (21) 
i=l 
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We show that CES turns out to be complete for dual aggregators, as the LDA assumption for any of 
the two implies that both are CES. We state it more formally below. 



Theorem 5 Suppose that at least one of x^, and that satisfies \21\) is a LDA. Then both x^, and z, 

[21 / [2l\ 

are CES. Furthermore, they are linked through the identity cpz = d[(j)x ) for some d € . 



(proof: see the Appendix). CES turns out to be complete from another standpoint: elasticities. 
Consider some LDA Xj,; its elasticity with respect to Xi {i = 1, 2, m) is defined as: 

^ " (t^)/(t)- 

Consider the economic assumption that all elasticities sum to one. We show that CES is complete for 
this assumption. 

Theorem 6 Let X2, x^) be any LDA. Then Yl^i^x'' — 1 if and only if x^ is a CES. 

(proof: see the Appendix). We now switch to another important economic quantity, the substitution 
elasticity of Xi for Xj in x^,, E^l^ \ defined by: 



where 



is the marginal rate of substitution of Xj for Xj. Another economic assumption commonly encountered 
is the fact that E^l^ ^ is assumed to be unit. We show that the complete LDA subclass for this 
assumption is, this time, Cobb-Douglas. 

Theorem 7 Let x^,{xi,X2, ...jXm) be any LDA. Then, there exists indices 1 < i,j < m such that 
^x^i-^xj _ gj^^ gjj^iy is a Cobb-Douglas. 

(proof: see the Appendix). It is interesting to notice that the LDA assumption competes with the 
homogeneity assumptions about x^ that are required to come up with the same result (i.e. without 
making the LDA assumption) . The fact that we are able to alleviate the economic setting (homogeneity 
ties up x^ with assumptions on returns to scale) while ending up with the same aggregator makes 
information a very valuable companion to introduce the true nature of popular economic aggregators. 
One question which remains is however what would imply the homogeneity assumption alone in a 
LDA setting. We define x* to be homogeneous of degree a G M* if and only if: 

x^(Axi, AX2, AXm) = A''x^(xi,X2, ...,Xm) , (25) 

for every A E M4-. We show that the complete LDA subclass for this assumption varies depending on 
the values of a. Without losing too much generality, the Theorem assumes that (px is differentiable. 

Theorem 8 Let x^(xi,X2, ...,Xm) be any LDA, and a E M*. Then: 

• X* is homogeneous of degree a ^ 1 if and only if it is a Cobb-Douglas; 

• Xj. is homogeneous of degree a = 1 if and only if it is a Cobb-Douglas or a CES. 
(proof: see the Appendix). 
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Optimality 
(LDA) 


Th. El 


Completeness 
Th. El Th. [71 Th. El 

(a /I) 


Th. El 

(a = l) 




Y 


Y 


Y 


N 


N 


Y 


Cobb-Douglas 


Y 


N 


N 


Y 


Y 


Y 


Leontief 


L 


L 


L 


N 


N 


L 


MST 


N 


N 


N 


N 


N 


N 



Table 2: Summary of our results on four families of aggregators: CES, Cobb-Douglas, Leontief and 
MST, with respect to the assumptions made in Theorems El El H and El (see text for details). 

4 Discussion 

Families of economic aggregators Table [2] summarizes the results obtained on three families of 
aggregators: CES, Cobb-Douglas and Leontief. For each of them we give the indication of whether 
they are LDAs (Y/N), whether they can be in the limit (L), and whether they become complete for 
the assumptions made in Theorems El El Hand El (Y / N / L). Remark that the Table makes a clear 
distinction between all these three families of aggregators. There exists various other aggregators in 
economic works; for obvious space reasons, we have chosen to focus on the most popular, and it turns 
out that all have strong relationships with LDAs, either directly, or at the limit. In order to cover the 
possible relationships between aggregators and LDAs, let us take a last example, of a general class of 
aggregators that we call Mitscherlich-Spillman-von Thiinen (MST) aggregators [141 118^ [T9]. a family 
in which the global form of aggregator x^, reduces directly or after a variable change to {6 € {— 1,-|-1}): 

m 

= J] (1 - eMOl^x^)) . (26) 

i=l 

Such aggregators date back to the XIX*'' century, and so they have preceded those we have been 
focusing on so far. What we can show is that, contrasting with their successors, MST aggregators are 
not LDAs. 

Lemma 4 MST aggregators are not LDAs. 
(proof: see the Appendix). 

Aggregators and economic constraints Modulo changes of variables. Theorems (l5|) - ([8]) could 
be alleviated from the constraint of the LDA choice under their respective economic assumptions. 
Consider for example (I2ip . in which we would like to plug any LDA. To be concrete, let us stick 
to prices and consumptions in (jl2p . Consider the generator cpc of some strictly concave LDA 
that aggregates its consumptions Cj for i = 1,2, ...,m. Let us say that strict concavity is chosen 
because usual consumption indexes are concave, to indicate the consumer's preference for diversity 
[9]. Consider the change of variables that involves some (concave) CES: 

(0ELs)"(4^^(c.)) • (27) 

If we consider that consumptions q are the actual observed consumptions (the Cj's being "hidden", 
non-observed consumptions), the aggregator c^, for those lifted consumptions is a CES which may be 
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Figure 1: Prices vs consumptions for a concave consumption LDA in (|30p: regardless of the LDA, for 
any good i, if its price is larger (resp. smaller) than the price index, its consumption cannot be larger 
(resp. smaller) than the normalized consumption index (see text for details). 

plugged in ([12]) • Reconstructing c^, is immediate as we have: 

^py'i^iu^*)) ■ (28) 

In order not to laden the discussion, let us consider that all weights f3i = 1/m, and that we keep the 
CES form for (pi = Pi). What interests us here is not exactly the consumption index (c^), but the 

normalized index, =^ c-^/m, which is really homogeneous to the consumption of a single good. We 
also define the same index for c^, and assume the same relationship as ()28p for these two indices. 
Differentiating (fT2]) in Cj yields: 



Pi _ ^c^si^i) 



[2] 

(29) 



from which we obtain using (p7|) and ([28|) : 

Pi _ (t>f{ci) , . 

P* (As (c*) 

Eq. (j30p is interesting because it displays a remarkably stable behavior that holds for any concave 

[21 

LDA. Because of Theorem |4] and the convexity of (j), 4>^ is monotonic decreasing and strictly positive. 
Thus, it converges towards some non negative value. Figure [1] gives an overview of how prices and 
consumptions always behave. There are two conclusions to draw from the figure. The first is a sanity 
check, as larger prices mean lower consumptions, a conclusion that follows from dividing ()30p for 
distinct goods i and j. Indeed, if a good i has price pi > pj, then we shall have Cj < Cj (notice, 
from ()27p . that it is equivalent to saying Cj < cj). We also remark that prices that are larger (resp. 
smaller) than the price index mean consumptions that are smaller (resp. larger) than the normalized 
consumption index (see Figured]). The second is the dampening effect of prices on consumptions: a 
small difference on prices may incur a very large difference on consumptions if those prices are already 
small, and it can make almost no difference on consumptions if prices are high. 
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1l > li 



Figure 2: Intervals for which the marginal rate of substitution of Xi for Xj (or xi) exceeds unit when 
x^ is a concave (left) or convex (right) LDA. 

Global behaviors of LDAs as economic aggregators Even without a change of variables, LDAs 
sometimes display economic regimes with extremely close behaviors, as witnessed by the marginal rate 
of substitution of Xi for Xj. Indeed, whenever IS On LDA, we have: 



Figure [2] displays the general behaviors of s!^^~*^- as a function of the concavity or convexity of x^. In 

the convex case. Theorem 2] and the convexity of bring that (j)x is monotonic increasing and strictly 
positive, hence the schema depicted in Figure [2] (right). The dashed rectangles depict the intervals for 
which this marginal rate of substitution would be greater than 1, that is, locations where we would 
be willing to trade more than one unit of x^ to obtain one unit of Xj. The behavior is remarkably 
linked with the global regime of a;*: when it is concave, Figure [2] (left) clearly displays a preference 
for diversity, while when it is convex, Figure [2] (right) shows the symmetric trend, an aversion for 
diversity. 

Aggregators of aggregators and economic programs LDAs may incorporate heterogeneous 
quantities and even LDAs as well, as it is common for economic aggregators to integrate other economic 
aggregators. Examples show how the whole aggregate may behave, and how global regimes underlined 
above for "baseline" aggregators also emerge in a simple manner for whole aggregates as well. Consider 
the determination of the global consumption index and money expenses m (hereafter, m does not 
refer anymore to the number of aggregated values) of a consumer, based on his/her whole budget r 
and price indices Pi, (we do not make any assumption on the form of and p^). The consumer solves 
the maximization of a utility aggregator n^: 





(31) 



maxii^ s.t. p*c^ + m = r . 



(32) 
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Let us investigate the general solution of ([32]) , under the sole assumption that is some concave LDA 
that mixes consumption and money via some Bernoulli distribution -6(7) for coefficients 7j,i = 1,2 
which leverages the importance of consumption and money in u^: 

^* = V^^^(7V^„(c.) + (l-7)V<^„(m/p,)) . (33) 

The following Theorem states the fundamental relationships that may be used to determine and 
m, for any concave LDA. 

Theorem 9 The optimal values for and m in (Wfl satisfy: 




(proof: see the Appendix). Because 4>u is monotonous decreasing and strictly positive, solving ([3l|) 
and (I35p can be done via a simple dichotomic search in the general case where (/I'^l (x) would be strictly 
monotonous (when strict monotonicity is not ensured, we may logically end up with an interval of 
values). The proof of Theorem [9] reveals an interesting relationship between consumption and money, 
namely: 

= (^) . (38) 

Figure [3] displays this relationship, and more precisely where m/p^ is located with respect to c*, 
depending on 7. Remark that when 7 > 1/2, which marks the predominance of consumption over 
money in the consumers' utility function n^. Figure [3] shows that indeed represents more than 
money in the whole budget, as we recall that + {m/p^) = r/p^,. The symmetric situation holds 
when 7 < 1/2. 

5 Conclusion 

The aim of this paper was to demonstrate that the choice of various economic aggregators, that have 
mainly been originally built on empirical economic grounds, meet optimality from the information 
standpoint, and most notably, remain the only optimal aggregators as various standard economic 
assumptions are considered. In these settings, they are the only ways one could summarize at best 
numerous individual variables in aggregate indexes, designed to represent their collective economic 
behavior at the global level. We have also displayed the fact that information aggregators meet 
consistent economic behaviors for fairly general settings, and that these behaviors may be extracted 
using simple derivations. 

Our results might have applications, and implications, on various economic areas whose theoretical 
and empirical results depend on the use of functions embodied in the LDAs studied in this paper. 
This includes, for instance, theories such as imperfect competition, international trade and growth 
theory. LDAs might make it possible to transfer, in the economic frameworks, three main advantages 
that Bregman divergences in general have extensively brought to statistics, classification and even 
geometry [H [151 IS]- First, the analytical expression of a LDA can be extremely complex, yet the 
abstraction of the general form ^ , and its properties (Theorems El H]) , makes it possible for fairly 
complex behaviors to be derived in a simple manner. Second, it has been shown in the aforementioned 
fields that algorithms for solving different problems on a single divergence could be generalized to 
solving the same problems on any divergence. Last, but not least, LDAs encode such a large variety 
of functions that they could make it easier to find tight fits to economic data. 
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Figure 3: Depiction of the relationships between consumption and money for the consumer's program 
([32]) (when 7 > 1/2 and 7 < 1/2; see text for details). 
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6 Appendix 

6.1 Proof of Theorem [1] 

Theorem 4 in [3] states the result for a function that meets [1], [3] and F{x,x) = 0,Va; € M'^ (the 
<^ in [2]). F also meets identity of indiscernibles as otherwise indeed, ^ would be violated for any 
distribution with distinct x and y as support and such that F{x, y) = 0. In this case, the solution of 
the left-hand side of ([5]) would be x or y, but not their average. 

6.2 Proof of Theorem H 

Because of its independent interest for Bregman divergences, we state the proof in the most general 
form: we do not make the assumption that the generator is separable (|4]). Without loss of generality, 
we assume F = 1 in ([8]). The concavity of //<^ means: 

E,-V-HE,V^(a;,,-)) < V-HE,V^(E,a^i,-)) ■ (37) 

Let Xij — ^^^(x'-) for x'- G im(V(p). Applying V,^ on both sides (99 is strictly convex, so V<^ is 
bijective) and replacing yields: 

V^(E,V-i(E,a;^,)) < EiV^{E,V-Hx[M . (38) 
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Eq. ([38]) states the convexity of the LDA V^(Ex)V(^^(X)), but Legendre duahty imphes 

= V~* , and we get /i = /x<^*, the dual of LDA /i<^. The proof starting from the convexity of fi^ 
follows the same path. 

6.3 Proof of Theorem H 

We prove the first part of the Theorem relating fi^ and s. Without loss of generality and to save 
notations, we make the proof for separable generators (jH) and take a route slightly different from a 
direct use of Jensen's inequality. Furthermore, we make the proof for the concave case, assuming ([8]) 
holds for fx^p. Division by T does not change the regime, and so //<^/r = V^^ (YlT^i (7«/r)V(^(xj)) is 
concave. We compute the tangential hyperplane to ^(^/T when Xi = x £ int(dom(i;^)), Vi = 1,2, ...,m. 
We know that since fJ,,p/T is concave, it shall be located below this hyperplane. We have V^_^/p = 
[• • • (7j/r)0Pl(a;j)/(/>[^](/i^) • • -J^, and so the tangential hyperplane to a; = [• • • x • • •]"'' on any x is (with 

def 

Ki = 7i/r): 

z{x) = fi^{x) + {x-x,V^^/-pix)) 
= X + {k, x) — X 



(K,a3) = ^(7,/r)xi . (39) 



We obtain /Ui^/T < X^I^i This yields to the statement that /i,^ < s, as claimed. The proof 

for the convex case is similar. The second part of the Theorem is an immediate consequence of the 
first part, and so this ends the proof of Theorem [H 

6.4 Proof of Lemma [1] 

We have (Aces(^) = "(^^ " l)x(''"^)/'^/o- and ((/>ces) ^ (^) = (o"2;/(a(2cr - l)))<^/(<^-i) . There remains 



to remark that B = Yl'iLi A = B^, use ^ and get: 



/^0CES 



1 ^ 1 ^\--' 

^^B—^/3,x^^ , (40) 



which, after simplification, gives (jlOp . as claimed. 



6.5 Proof of Theorem [5] 



Without loss of generality, we first show that, if is a LDA that meets ()2ip . it is a CES. We thus 
assume the form ([8]) for x^,. Using ([8|), we obtain that x^, satisfies (we replace in this proof notation 
4>x by the simpler (f) for the sake of readability): 

^^Ht) = fE^^^'^Hx.) , (41) 

i=l 

with 7i > 0,Vi = l,2,...,m, and T =^ Yl'iLiU- If we differentiate (fT2]) with respect to any Xi, using 
dSD, we get: 

Zi_ ^ 7i(p^^\xi) 

0[2i(x./r) ■ ^ ^ 
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We multiply both sides by Xi/{TXi,), sum for all i, simplify via (jl2p . rearrange, and get: 

^ m 

t^'"(t) = fEW^kx.). (43) 

i=l 

Now, we match (|4ip with (j43p . and get that (/> musi satisfy: 

3k; G s.t. = KX(/)t^](x),V2; G dom(/) . (44) 

The solution is found to be (p^^^x) oc x'^, i.e.: 

0(x) = , (45) 

with (i G M* any constant that keeps (j45p convex. Matching (j45p with (jlip implies cr = 1/(1 — k), and 
we get the proof that is a CES. 

Lemma [2] then implies that is also a CES. The proof that (p^z'^ = d (0'^') ^ for some d G 
follows from the expressions of (pc and (pp in Lemmata [1] and [21 

6.6 Proof of Theorem [6] 

It is well-known that the property is true for any CES, so we investigate the reverse implication, and 
given any LDA x*, let (j) (which replaces (px for the sake of readability) denote its generator. Using 
([8]), we obtain: 

^ JjXjCP^^Hxi) 

x,cpm{xjT) ■ ^^^^ 

Summing this for i = 1,2, ...,m and rearranging, we get that the sum of elasticities of is one implies: 

^ m 

= ^EW^k^.). (47) 

j=i 

This is exactly (j43p . There remains to conclude that is a CES as in the proof of Theorem [5j 

6 . 7 Proof of Theorem [7] 

Implication <^= is folklore, so we investigate the reverse implication. For any LDA x^ whose generator 
is denoted (p, Exl~^^^ = 1 implies, with Xij =^ Xj/xf. 



(48) 



implying for some k > which does not depend on Xi or xj . We obtain that x* satisfies 

the following PDE: 



f _ [ ^ 

'\dxij ^ \dxj 



. (49) 



Because LDA, we have dx^/dxi = 7j(/)[^] (xj)/(/)[^l (x^), and so (|49p becomes: 

7jXj0[^l(xj) = KjjXjCp^'^^Xj) . (50) 
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Since > and ([50]) holds for any Xi,Xj G mt(dom(/)[^l), we obtain that a;0[^l(x) is constant. 

This yields (j){x) = b{xlogx — x) for some constant b G the generator of Cobb-Douglas LDA 

([T8|), as claimed. 

Remark: (|50p also proves that if m > 2 and we require unit substitution elasticity between more 
than two goods, then necessarily k = 1 and ji = 'jj, = 1,2, ...,m. By means of words, fixing unit 
substitution elasticity for more than two goods implies equal weights for the goods. Clearly, this is a 
property of Cobb-Douglas aggregator rather than a restriction for LDAs. 



6.8 Proof of Theorem [8] 

Implication <^= is folklore, so we investigate the reverse implication. For any LDA whose generator 
is denoted (p, ([25]) implies: 



Take some Xi, i = 1,2, ...,m, and differentiate both sides in Xj. We get after simplification: 



(52) 



Case 1: a ^ 1. Suppose that Xj = Xj = z/X,\/i,j = 1,2, ...,m, which implies x^ = z/X as well. 
Eq. ([52]) simplifies to: 

cP^Hx^-'z) = (53) 

i.e. is homogeneous of degree —1. Euler's homogeneous function Theorem implies that </)[^] 
satisfies the following PDE: 

x0[^l(x) + (/)[2](x) = , (54) 

whose solution is (p^'^'^^x) oc 1/x (with a positive factor), i.e. (/>(x) = 6(xlogx — x) for some 
constant b £ M+,*, the generator of Cobb-Douglas LDA ([T8]) . as claimed. 



Case 2: a = 1. In this case, (I52p implies: 

<A[2](Ax) = g{XH^^\x) , (55) 

for any function g{X) G M*. Suppose without loss of generality that g is Ci, so that we can take 
the route of the proof of Euler's homogeneous function Theorem. We differentiate ([55p in A, and 
take the resulting equation for A = 1. We obtain the following PDE: 

x0[3](a.) _g[i](l)^[2](^) = , (56) 

i.e. 4>^'^^{x) oc x**, where k G is some constant. We obtain that (f> is either of the form of (|lip . 
or ([Is]) , the generators of CES and Cobb-Douglas LDAs, as claimed. 
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6.9 Proof of Lemma H] 

After differentiation on some Xi, should it be a LDA, any MST aggregator x^, with generator would 
satisfy: 



7i X 



-eeMOl^x.) ^ ,;x^P](..)x-^, (57) 



(1 - exp(07i^,)) n ^ , ^[2](^ 



with 7- the LDA weight for Xj. This would imply (p^'^^x) = 1/x, from which the simplification 
of ([57|) yields that regardless of the value of Xi, the corresponding weights ji and 7^' must satisfy 
— 6*74X4 exp(07jXj) = 7,-(l — exp(07jXj)), impossible. 

6.10 Proof of Theorem H 

Let A be the Lagrange multiplier for (j32p . so that the Lagrangian is L =^ + A(r — p^c^, — m), and 
we obtain the following stationarity conditions for the optimum: 

dL 70«'(cvc) , „ .^g. 

aL ^ (l-7),^L'J(m/p,) 

Solving (I59p for A and simplifying (I58p yields: 



A = . (59) 



^i^'(c.) = ^01^1 (p^) . (60) 



There remains to use the identity j?*c* + m = r to get (pl|) and ([35]) . as claimed. 
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